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Abstract 

If a quantum system A, which is initially correlated to another system, E, under- 
' (— | '■ goes an evolution separated from E, then the correlation to E generally decreases. 

Q_[ Here, we study the conditions under which the correlation disappears (almost) com- 

pletely, resulting in a decoupling of A from E. We give a criterion for decoupling 
in terms of two smooth entropies, one quantifying the amount of initial correlation 
between A and E, and the other characterizing the mapping that describes the evo- 
lution of A. The criterion applies to arbitrary such mappings in the general one-shot 
setting. Furthermore, the criterion is tight for mappings that satisfy certain natural 
conditions. Decoupling has a number of applications both in physics and informa- 
tion theory, e.g., as a building block for quantum information processing protocols. 
As an example, we give a one-shot state merging protocol and show that it is essen- 
tially optimal in terms of its entanglement consumption/production. 



1 Introduction 



Correlations in quantum systems, and in particular entanglement, have been in the focus 
of (both theoretical and experimental) research in quantum information science over the 
past decades. As a result, one has nowadays a pretty good (although still not complete) 
understanding of quantum correlations and, in particular, the processes that create them. 
In this work, we take — so to speak — an opposite approach and study conditions under 
which two systems can be decoupled, i.e., brought to a state where they are uncorrelated. 

We call a system, B, decoupled from another system, E, if the joint state of the two sys- 
tems, pbe, has product form ps® Pe- Operationally, this means that the outcome of any 
measurement on B is statistically independent of the outcome of any measurement on 
E. Or, in information-theoretic terms, the system E does not give any information on B 
(and can therefore safely be ignored when studying B). 



Decoupling Theorem. Our goal is to characterize the conditions under which the evo- 
lution of a system results in decoupling. For this, we consider a system, A, that may 
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Figure 1: Decoupling. The initial system, A, may be correlated to a reference system E. The evo- 
lution is modeled as a mapping T from A to -B. The final state of B is supposed to be independent 
of E. The subdivision of T into a unitary W and a mapping T is required for the formulation of 
our decoupling criterion. 

initially be correlated to E. Furthermore, we assume that the system A undergoes an 
evolution, described by a TPCPM 1 T from A to B, during which no interaction with E 
takes place (see Fig. 1). The main result of this work is a decoupling theorem, i.e., a crite- 
rion that provides necessary and sufficient conditions for decoupling (of B from E). The 
criterion depends on two entropic quantities, characterizing the initial state, pae, and the 
mapping T, respectively. 

The decoupling criterion can be conceptually split into two parts, called achievability and 
converse part, which we now describe informally. The full technical statements are pro- 
vided as Theorems 3.1 and 4.1 in Sections 3 and 4, respectively. For their formulation, it 
is convenient to view T as a sequence, T = T °U, where U is an arbitrary unitary on A, 
and T a fixed TPCPM from A to B. 

Achievability: Decoupling is achieved for most choices ofU if H^ ain (A\E) p + H^ ain (A\B) T ^ 0. 

Converse: Decoupling is not achieved for any choice ofU if H^ ahl (A\E) p + H^ xax (A\B) T ^ 0. 

The criteria refer to the smooth min- and max-entropies introduced in [RW04, Ren05], which 
can be seen as generalizations of the von Neumann entropy (cf . Section 2 for definitions 
and properties). The smooth min-entropy H^ lin (A\E) p is a measure for the correlation 
present in the initial state pae — the larger this measure, the less dependent is A on E 
(see Table 1 for some typical examples). The quantities H^ lin (A\B) T (for the achievability) 
and H^ naK (A\B) T (for the converse) measure how well the mapping T conserves correla- 
tions. Roughly, they quantify the uncertainty one has about a "copy" of the input, 2 A, 
given access to the output, B, of T (cf. Table 2). We note that the expressions for the 
achievability and for the converse essentially coincide in many cases of interest (see the 
discussion in Section 4). 

As a typical example, consider m qubits, A, that are classically maximally correlated 
to E (so that H^^AlE) = 0, cf. second row of Table 1). Furthermore, assume that A 
undergoes a reversible evolution, U, after which we discard m — m' qubits, corresponding 
to a partial trace, T = Tr m _ m / (see last example of Table 2). Our criterion then says that 
the remaining m' qubits will, for most evolutions U, be decoupled from E whenever 
m! < m/2. Conversely, if this condition is not satisfied, some correlation will necessarily 
be retained. 

: A Trace-Preserving Completely-Positive Map (TPCPM) is a linear function that maps density operators to 
density operators. 

2 More precisely the entropy is evaluated for the state tab obtained by applying T to one half of an 
entangled state. 



2 



Description of initial state 
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Table 1: Dependence on the initial state. The table illustrates how the term H^ nin (A\E) p (for 
e —* 0) in the decoupling criterion depends on the initial state pae- In all three examples, A 
is assumed to be a fc-qubit system with orthonormal basis {|i)^}f =1 . Similarly, {|i)s}^ =1 is an 
orthonormal family of states on E. 

Applications. The notion of decoupling has various applications in information theory 
and in physics. Many of these applications have in common that decoupling of a system 
B from a system E is used to show that B is maximally entangled with a complementary 
system, R. Indeed, under the assumption that R is chosen such that the joint state, pber, 
is pure, pbe = Pb®Pe immediately implies that there exists a subsystem R' of R such that 
the state on pbr' is pure. If, in addition, ps is fully mixed, pbr' is necessarily maximally 
entangled. 

In the context of information theory, this type of argument is, for example, used to an- 
alyze state merging [HOW05, HOW07], i.e., the task of conveying a subsystem from a 
sender to a receiver — who already holds a possibly correlated subsystem — using clas- 
sical communication and entanglement. Another example, where decoupling is used in 
a similar fashion, is the Quantum Reverse Shannon Theorem [BSST02, BDH+09, BCR11]. In 
fact, the proof of this theorem given in [BCR11] refers to a coherent form of state merging 
(also known as the Fully Quantum Slepian Wolf or Mother Protocol [ADHW09]) where the 
classical communication is replaced by quantum communication. Decoupling can also 
be used for the characterization of correlation and entanglement between systems, era- 
sure processes, as well as channel capacities (see, e.g., [GPW05, Bus09, HHWY08]). In 
addition, its classical analogue, Privacy Amplification [BBCM95, RK05], is widely used in 
classical and quantum cryptography. 

Decoupling processes are also crucial in physics. For example, the evolution of a ther- 
modynamical system towards thermal equilibrium can be understood as a decoupling 
process, where the system under consideration decouples from the observer (somewhat 
analogous to the considerations in [LPSW09, Par89a, Par89b]). Recent work indeed sug- 
gests that there is a close relation between smooth entropies and quantities that are rel- 
evant in thermodynamics [DRRV09, dRAR + ll]. Similarly, black hole radiation may be 
analyzed from such a point of view [HP07, BP07, BZ09]. 

History and Related Work. While various standard results in quantum information 
theory have been proved using ideas related to decoupling, the concept came into its 
own with the discovery of State Merging protocols [HOW05, HOW07] and, later, the Fully 
Quantum Slepian Wolf protocol [ADHW09]. These are based on specific decoupling pro- 
cesses where the mapping T is either a projective measurement or a partial trace. In this 
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Description of mapping 
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Table 2: Dependence on the mapping. The table illustrates how the term H^ lill (A\B) T (which in 
these examples coincides with Hf nliK (A\B) T ) in the decoupling criterion depends on the mapping 
T. In all five examples, the input space, A, is assumed to consist of to qubits with orthonormal 
basis {|i)A}f=i- The last two examples have a smaller output space consisting of only m' qubits. 
The penultimate one can be seen as a combination of the first and the second, and the last one can 
be seen as a combination of the first and the third. (The smooth min-entropies are evaluated for 
e ->■ 0.) 

early work, the decoupling was analyzed in terms of the dimensions of certain subsys- 
tems (rather than smooth entropies). 

These decoupling results have been generalized in [WR09, Ber08] to include mappings T 
that consist of combinations of projective measurements and partial traces. Furthermore, 
in this work, the criterion has been expressed in terms of smooth entropies. Indepen- 
dently of this, a general decoupling theorem that can be applied to any type of mapping 
has been developed [Dup09]. This result is essentially (up to the use of different entropy 
measures) equivalent to Theorem 3.1 presented here. We also note that the aforemen- 
tioned characterizations of decoupling can be seen as special cases of this general result. 

The above work was mostly concerned with achievability. Converse results were so far 
only known in special cases. In particular, in [BRW07] and [Ber08] (see also [Ren09]) 
converse theorems have been derived for the case where the mapping T is a projective 
measurement. The converse theorem presented here, Theorem 4.1, generalizes these re- 
sults. 

Structure of the Paper. In Section 2 we introduce the notation and review the defini- 
tions and main properties of the entropy measures used in this work. Our main achiev- 
ability result for decoupling is given in Section 3, whereas Section 4 contains a converse 
that is tight in many cases of interest. The use of the decoupling technique is illustrated 
in Section 5, where we show how to obtain optimal one-shot quantum state merging. We 
conclude with a discussion in Section 6. 
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2 Preliminaries 



2.1 Notation 

We denote the Hilbert space associated to a system A by Ha- We only consider finite- 
dimensional systems and denote the dimension of Ha by \A\. The set of linear operators 
on % is denoted by C{H) and the set of nonnegative operators on H by V(H). We define 
the sets of subnormalized states S^iji) = {p e V(H) : Tr p < 1} and normalized states 

S = (H) = {peV{H) :Trp = 1}. 

The tensor product of T-L a and H b is denoted by Hab = Ha ®Hb- For multipartite op- 
erators pab £ V(Hab)> we write pa = Ttb(pab) for the corresponding reduced operator. 
For Ma e C(Ha), we write Ma = Ma ®1b for the enlargement on any Hab, where 1 b 
denotes the identity in V{Hb)- Isometries from Ha toHs are denoted by Va— B- 

Completely positive maps from C{Ha) to C{Hb) are called CPMs and trace-preserving 
CPMs are called TPCPMs. For H A , U B with orthonormal bases {K>a}|=i, {K>s}if i and 
\A | = \B\, the canonical identity mapping from C{Ha) to C{Hb) with respect to these 
bases is denoted by 2a— b, i.e., 2a— b( 1 00 U) = I 001b- 

For /? e V(H), \\p\\x> denotes the operator norm of p, which is equal to the maximum 
eigenvalue of p. The trace norm of p e V(H) is defined as ||p||i = Tr(ypfy) an d the 
induced metric on S<^(H) is called trace distance. 3 The fidelity between p,a e S<^(H) is 
defined as F(/>, <r) = H-v/pv^lli- 

We will make use of the Choi-Jamiolkowski isomorphism, which relates CPMs to positive 
operators, and which we denote by J. 

Lemma 2.1. [Jam72, Cho75] The Choi-Jamiolkowski map J takes maps T A ^ B : C(Ha) — * 
C(Hb) to operators J(T A ^ B ) e C(Ha> ® Hb), where Ha 1 = Ha- It is defined as 

J(J A - B ) = {lA>®T A - B )(\<5>)m A >A) , 

where |<&)a'a = Yli N)a' ® 10a- 4 TTze map J bijectively maps the set of CPMs from Ha to 
Hb to the set V{Ha> ® Kb), and its inverse maps any tab £ V{Ha> ®Hb) to 

T A - B : M A ~ |4| • Tr A (t AB mJ) , 
a>/zere Mj denotes the transpose of Ma with respect to the basis {|0a}[=i- 

2.2 Smooth Entropies 

The smooth entropy formalism [Ren05, RW04] has been introduced in (classical and 
quantum) information theory to study general one-shot scenarios, in which nothing needs 
to be assumed about the structure of the relevant probability distributions or quantum 
states (e.g., those modeling noise processes in a communication channel). The formalism 

3 The trace distance is often defined with an additional factor |, which we omit here. 
4 The Choi-Jamiolkowski isomorphism is sometimes defined with an additional dimensional factor of \A\; 
we choose not to do this here. 
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therefore overcomes a limitation of the established theory, where it is usually assumed 
that the relevant processes can be modeled as asymptotic sequences of independent and 
identically distributed (iid) subprocesses. 

In this section we provide the definitions of the underlying entropy measures, called 
smooth min- and max entropy, and state some of their basic properties. Further prop- 
erties are summarized in Appendix A. For a more detailed discussion of the smooth en- 
tropy formalism we refer to [Ren05, KRS09, TCR09, TCR10, Dat09]. 

Recall the following standard definitions. The von Neumann entropy of p e S = {%) is 
defined as 5 H(p) = — Tr(plog p) and the conditional von Neumann entropy of A given 
B for PAB £ S={U) is defined as H(A\B) p = H{AB) p - H(B) p . 

Definition 2.2. Let pab e <S<z(T-Lab)- The min-entropy of A conditioned on B is defined as 

H mhl (A\B) p = sup sup{A e M : 2~ A 1 A ® a B - pab > 0} . 

a B eS=(H B ) 

The max-entropy of A conditioned on B is defined as 

H max (A\B) p = sup log F(pab, 1a ® <tb) 2 • 

a B eS = (H B ) 

In the special case where B is trivial (i.e., one-dimensional), we write H m - m (A) p and 
H max (A) p instead of H m - m (A\B) p and H max (A\B) p , respectively. It can be shown that 

H min (A) p = - log || pa || oo and H mSLX (A) p = 21ogTr v / pX 

The smooth min- and max-entropy are defined by extremizing the non-smooth versions 
over a set of nearby states, where nearby is quantified by the purified distance. 

Definition 2.3. Let p,a e S<^ (H). The purified distance between p and a is defined as 

P(p,a) = *Jl-F(p,a)i , 
where F(p,a) = F(p,a) + — Tr p) (1 — Tr a) denotes the generalized fidelity. 

The purified distance is a distance measure on S<^{1-L) [TCR10, Lemma 5]. As its name 
indicates, P(p, a) corresponds to the minimum trace distance between purifications of p 
and a. 

Henceforth p, a e S^(H) are called e-close if P(p, a) ^ e and this is denoted by p ^ £ a. 
We use the purified distance to specify an e-ball around p e S<^ {%): 

B%p) = {p'eS^n):p'^ £ p}. 

For more about the purified distance we refer to [TCR10]. 

Definition 2.4. Let e > and pab £ S^Hab)- The e-smooth min-entropy of A conditioned on 
B is defined as 

H £ mill (A\B) p = sup H min {A\B) p . 

Pab£B £ (pab) 

The e-smooth max-entropy of A conditioned on B is defined as 

H* max (A\B) p = _ inf H max (A\B) p . 

Pab£o e (Pab) 

5 All logarithms are taken to base 2. 
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The min- and max-entropy are dual to each other in the following sense. 

Lemma 2.5. [TCR10, Lemma 16] Let e > 0, pab £ S^{%ab) and Pabc e S<^(Habc) an 
arbitrary purification of pab- Then 

Smooth entropies satisfy various natural properties analogous to those known for the 
von Neumann entropy One of the most important ones is the data processing inequality. 

Lemma 2.6. [TCR10, Theorem 18] Let e > 0, p A B e S^(Uab), and let Tb^c be a TPCPM 
from B to C. Then 

H min( A \ B )p < H Lin( A \ C )T(p) 
H La, x ( A \ B )p *S #max ( A I C )t( P ) • 

Smooth entropies are generalizations of the von Neumann entropy, in the sense that the 
von Neumann entropy can be retrieved as a special case via the Quantum Asymptotic 
Equipartition Property (AEP). 

Lemma 2.7. [Toml2, Corollary 6.6 and 6.7] Let < e < 1 and pab £ S=(Hab)- Then 

lim -H £ min (A\B) p&n = H(A\B) p 

n— >oo n 

\\m -H^ x (A\B) p ^ = H(A\B) p . 

n— >oo n 

For more properties of smooth entropies we refer to the Appendix A and [Ren05, KRS09, 
TCR09, TCR10, Dat09]. 

For technical reasons we will also need the following auxiliary quantities. 

Definition 2.8. Let pab £ S^(Hab)- The quantum collision entropy of A given B is defined as 

2 



H 2 {A\B) p = sup -logTr 

a B eS = (H B ) 



1a ® cf b 1/a )pab{1a ® ° B 1/4: , 



Definition 2.9. Let pab e S^(Hab) and as £ S^(Hb)- We define 

)p\ 



H m3uX (A\B) pla := log F(pab, 1a ® <tb) 2 - 



Note that i? max (^|5) p = sup ct6<Ss;(Wb) H mBX (A\B) p]a . 
Definition 2.10. Let pab £ S^(Uab) and as e S^(Hb)- We define 

H miQ {A\B) pW : = sup{A e M : 2~ A 1 A <g> ag - > 0}. 

Note that F min (yl|B) p = sup^^^) H min (A\B) p \ a . 

Finally we note that, since all Hilbert spaces in this paper are assumed to have finite 
dimension, the infima and suprema in the expressions above can be replaced by minima 
and maxima, respectively. 
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3 Achievability 



In this section, we present and prove a general decoupling theorem (Theorem 3.1), which 
corresponds to the achievability part of the criterion sketched informally in Section 1. 
The theorem subsumes and extends most previous results in this direction. 

3.1 Statement of the Decoupling Theorem 

As explained in the introductory section (see Fig. 1), we consider a mapping from a sys- 
tem A to a system B. The mapping consists of a unitary on A, selected randomly accord- 
ing to the Haar measure over the unitary group on tLa? followed by an arbitrary mapping 
T = Ta-^b- hi applications, T often consists of a measurement or a partial trace (see Ta- 
ble 2 for examples). The decoupling theorem then tells us how well the output, B, of the 
mapping T is decoupled (on average over the choices of the unitary) from a reference 
system E. 

Theorem 3.1 (Decoupling theorem). Let e > 0, pae £ S = {T~Lae), and Ta-^b « CPM with 
Choi-Jamiolkowski representation tab = J{T) such that Ty{jab) 1- Then 

[ \\T{U PAE tf) - T B ® PfiL dtf < 2-^ H ^\E) P -\H^{A\B) T + 12£ ( (1) 
JV(A) 

where \ -dU denotes the integral over the Haar measure over the full unitary group on Ha- 

The theorem thus provides a bound on the quality of decoupling that only depends on 
two entropic quantities, H^ ain (A\E) p and H^ ain (A\B) T . The first is a measure for the cor- 
relations between A and E that are present in the initial state, pae- The second quantifies 
properties of the mapping T, which is characterized by the bipartite state tab obtained 
via the Choi-Jamiolkowski isomorphism J. Hence, in order to minimize the right hand 
side of (1), no channel ends up being better suited for some types of states than for oth- 
ers or vice-versa. Furthermore, as discussed in Section 4, the bound in (1) is essentially 
optimal in many cases of interest. We also note that, using Markov's inequality, the expec- 
tation value over the unitaries U can be turned into a bound that holds for most unitaries. 
That is, for any p > 0, 

InUpAsrf) -T B <S>PeI< -2-^(A\E) P -h^ mi M\B), + 12 £ 

111 p p 

holds with probability at least 1 — p (for U chosen according to the Haar measure). 

Our first step in proving Theorem 3.1 is to prove a version involving non-smooth min- 
entropies (Theorem 3.2). Then, in a second step, we show that smoothing preserves the 
essence of the theorem. Note that Theorem 3.2 may be of interest in cases where no 
smoothing is required since it is slightly more general: it applies to any completely posi- 
tive T, not only trace-non-increasing ones. 

Theorem 3.2 (Non-smooth decoupling theorem). Let pae £ S<^{1-Lae) and Ta-^b a CPM 

with Choi-Jamiotkoivski representation tab = J(T). Then 

f \\T{Up AE tf) -T B ® p E \ dU ^ 2-\ H ^ A \ E ^ H ^ A \ B ^ , 
where J -dU denotes the integral over the Haar measure over unitaries U acting on A. 



8 



3.2 Technical Ingredients to the Proof 



The proof is based on a few technical lemmas, which we state and prove in the following, 
and which may be of independent interest. We note that they partly generalize tech- 
niques developed in the context of privacy amplification [RK05, Ren05, TRSS10] as well 
as earlier work on decoupling (see, e.g., [HOW07]). 

Lemma 3.3 (Swap trick). Let M,N e C{H A )- Then Tr[(M <g> N)F] = TV [MTV], where F 
swaps the two copies of the A subsystem. 

Proof. Write M and TV in the standard basis for Ha- M = Xiij m «iKXil andiV = 2w ra wl^X^I- 
Then 

^] mijn k i\i}(l\ <g> \k)(j\ 

ijkl 

= ^m ijnji = Tr[MN] . 

ij 

□ 



Tr[(M <g> N)F] = Tr 



2 m i:j n k i\i)(j\ ® \k)Q\ ]F 



Tr 



.ijkl 



The second lemma involves averaging over Haar distributed unitaries. While it would 
take us too far afield to formally introduce the Haar measure, it can simply be thought 
of as the uniform probability distribution over the set of all unitaries on a Hilbert space. 
The following then tells us the expected value of U® 2 M(tf)® 2 with M e C(Hf 2 ) when 
U is selected "uniformly at random". 

Lemma 3.4. Let M e £(nf 2 ). Then 

E(M) := f U® 2 M{U^)® 2 dU = al A A> + @F A , 
JV(A) 

where Fa swaps the two copies of the A subsystem, a and (3 are such that Tr[M] = a\A\ 2 + (3\A\ 
and Tr[MF] = a\A\ + (3\A\ 2 , and dll is the normalized Haar measure on U(A). 



Proof. This follows directly from a standard result in Schur-Weyl duality, e.g., Proposi- 
tion 2.2 in [CS06]. The latter states that E : £(Hf 2 ) — C(nf 2 ) is an orthogonal pro- 
jection onto span{l,F} under the inner product (A, B} = Tr[A*B]. Hence, E(M) can 
be written as al A A> + P^A as claimed, and the conditions Tr[lE(M)] = Tr[M] and 
Tr[FE(M)] = Tr[FM] must be fulfilled, and these lead to the two conditions on a and 
(3. □ 



The following bounds the ratio of the purity of a bipartite state and the purity of the 
reduced state on one subsystem: 

Lemma 3.5. Let £ A b e V(Uab)- Then 

J_ < ^ [Ub 2 ] 

\A\ - Tr[t B 2 ] - lAl - 
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Proof. Letting A' be a system isomorphic to A, we first prove the left-hand side 

Tr [£b 2 ] = Tr [Tr A [Ub? 

= Tr [Tv A [C AB ] Tr A > [£a'b]] 
= Tr [Ub (Tr A , [£ A , B ] <8 1a)] 
= Tc[(^ B ®l A /)(eA'B®lA)] 

< VT> [(Cab ® IaO 2 ] Tr [{Wb ® U) 2 ] 
= Tr [Cab 2 ® 1a'] 
= |A| Tr [^b 2 ] , 

where the inequality is due to an application of Cauchy-Schwarz. The right-hand side 
follows from the fact that £ab ^ \A\ ■ 1a ® £b- This can in turn be seen from the fact that 
\A\ ■ 1 A <g> £ B = Xiili U A £, AB {U A )\ where the U A 's are Weyl operators with U\ = 1 A . □ 

In the main proof, we will need to bound the trace distance between two states. The 
following lemma will allow us to do this: 

Lemma 3.6. Let M e C{U A ) and a e V{H A ). Then 



||M||i < <v/Tr[>] Tr[a- 1 /4MCT- 1 /2Mta- 1 /4] . 
In particular, if M is Hermitian then 



||M||i s$ yTr[a]Tr[(a- 1 /4Mcj- 1 /4)2] . 

This is a slight generalization of Lemma 5.1.3 in [Ren05]; we give a different proof here 
for completeness. 



Proof. 

|| M||i = max|Tr[£/M]| 



max 

U 



< maxyTr[(a 1 /4J7 (T i/4)( (7 i/4 [/ t (7 1 /4)]Tr [a'^Ma-^M^a- 1 / 4 ] 
max TrfcrVs [7^1/2^] Tr [a-V^Mo-VaM^-i/*] 



Tr[cr] Tr [a-V^Ma-WMl a' 1 / 4 ] , 



where the inequality results from an application of Cauchy-Schwarz, and the maximiza- 
tions are over all unitaries on A. The last equality follows from 



maxTrfcj^f/cr 1 / 2 ^] max yTr[a] Tr^^WU^W] 
= Tr[a] 

< maxTrrcr 1/2 C/a 1/2 [/ t l . 

U 



□ 
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3.3 Proof of the Non-Smooth Decoupling Theorem (Theorem 3.2) 



Throughout the proof, we will denote with a prime the "twin" subsystems used when 
we take tensor copies of operators, and F$ denotes a swap between S and S'. 

We first use Lemma 3.6; for a E £ S={Hb) and Ce e S = (H E ) we get 

T~B ® PEL 



<a/Tt 



Now define the CPM 7a-».b(-) = CJ B 1//4 7A^B(-) CJ B 1/ ' i an d the states t^'b = <^(7") and 
Pae = Ce^P ae ^e^- ^ e then rewrite the above as 



-1/4 



\\T(Up AE tf)-TB®PE\\ 1 < yTr 
Using Jensen's inequality we obtain 

We now simplify the integral 



f(Up AE W) -f B ®p E 



t(Up AE W) - TB® PE 



dU . (2) 



I 



Tr 
Tr 
Tr 
Tr 



r{up AE u^)-f B ®pE 

2 



dU 



T(Up 



AE 



T(Up AE tf) 



dU 

dU - 2 Tr 



(fB ® PE 

T^UpaeUUu) [f B ®PE 



dU + Tr 



[TB ® PE) 



Tr 



[TB ® PE) 



t(Up 



AE 



dU-Tr[f B ]Tr[p 2 E ]. 



We rewrite the first term as follows 

2 



I 



Tr 



t(Up AE tf) 



dU 



|Tr (t{U~p AE 

Jlr [(t® 2 (C/® 2 pf|(^)® 2 )) F BE 
|Tr ({([/t)® 2 (f^)® 2 (F B )[/® 2 } (8 F £ 
PaI (j{{tfrW)® 2 {F B )U® 2 ) dU ® F E 



(3) 



where we have used Lemma 3.3 in the first equality, and the definition of the adjoint of a 
superoperator in the third equality. We now compute the integral using Lemma 3.4 



j{tf)® 2 {P) m {F B )U m dU = al AA > + (3F A 
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where a and (3 satisfy the following equations 



a\A\ 2 + p\A\ = Tr 



(P) m {F B ) 



Tr 



F B {t) m {lAA>)} = \A\ 2 T,[F B ff] 



\A?^[f B ] 



and 



a\A\ +P\A\ 



Tr 



(■P)®\F B )F A 



Tr 



F B T® 2 {F A ) 



= \A\ 2 Tr [F B Tr AA , [ff |(J^ <g> 1 BB ,)]] 
= \A\ 2 Tr [(1^, is F B )f||(F A ® 1 BB ,)] 
= \A\ 2 Tr[F AB ff 2 ] = \A\ 2 Tr[f 2 AB ] . 

In the third equality, we have used the fact that tab is a Choi-Jamiolkowski representation 
of T (Lemma 2.1); the fourth equality is due to the fact that the adjoint of the partial trace 
is tensoring with the identity. 



Solving this system of equations yields 



a 



1 |A| 



1 



f3 = Tr[f 2 AB ] 



I A \2 \A\Tr[f%] 



V 



141 



1 



By applying Lemma 3.5, we can simplify this to a < Tr [f|] and (3 < Tr [r^ B ] . Substitut- 
ing this into (3) and using Lemma 3.3 twice, and then substituting into (2) yields 

J lT(U PAE rf) -r B ® pE^dU ^Tr[f 2 AB ]Ti[p 2 AE ] . 



Finally we get the theorem by using the definitions of tab, Pae and the definition of Hi 
(Definition 2.8). 

□ 



3.4 Proof of the Main Decoupling Theorem (Theorem 3.1) 

We now prove our main result, which is obtained from the non-smooth decoupling the- 
orem (Theorem 3.2) by replacing the collision entropies, H2, by smooth min-entropies. 

First, note that H2 is always greater or equal to H mm (Lemma A.l) and therefore we are 
allowed to replace the H2 terms on the right-hand side of the statement of Theorem 3.2 
by H mm terms. Thus we only have to consider the smoothing. 

Let p AE e B%pae) be such that H £ min (A\E) p = H min (A\E) p and tab e B%tab) be such 
that H £ min (A\B) T = H min (A\B) f . 
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Furthermore write ? — r = A + — A_, where A+ e V(Hab) have orthogonal support, and 
likewise, p — p = 5 + — <5_ with 5 + and <5_ having orthogonal support. By Lemma B.l we 
have ||? — t||i < 2s and hence IjA+j^ ^ 2e. 

Moreover define T, T>- and V + as the unique superoperators that are such that ? = J(T), 
A_ = J{V-) and A + = J{V+) respectively. 

Using the non-smooth decoupling theorem (Theorem 3.2) we get 



f f{UpA E U^)-?B®pE 
Ju(A) 

f f(Up AE rf) - t b ® PE dU-Ae. 

JU(A) 1 

f \\T(U P AEtf)-TB® P ELdU- f f(Up AE tf)-f(Up AE tf) 
Ju(A) JU(A) 

- f T(U PAE tf) - f{U PAE tf) 

JU(A) 



dU 



dU - 4e . 



We now deal with the second term above 

f 

W(A) 



d?7 



f f{U PAE tf) - f{Up AE tf) dU = f f - <L)E/t) 

Ju(A) J 

J ||f (Etf+t/^l x dZ7 + J ||f (Ett_E/t) 
|Tr[f (CM+t/t)]^ + J Tr[T(i/5_ i7 t )]di7 
(Tr[<5 + ] + Tr[<5_]) 



sS 4eTr[?] 

We deal with the third term in a similar fashion 



f T(U PAE tf)-f(Up AE tf) dU 

JU(A) 1 

= j \\(V + -V_)(Up AE U%dU 

< J I P+ (U PAE Ul) \\ 1 dU + ^ ||X>_ {U PA eU^) I ! d?7 



Tr 



Tr 



= Tr [A+ (8) p B ] + Tr [A. 



This results in 

f 

W(A) 



LI IS 11 L 

Jv(A) 



□ 
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4 Converse 



The main purpose of this section is to state and prove a theorem (Theorem 4.1) which 
implies that the achievability result of the previous section (Theorem 3.1) is essentially 
optimal for many natural choices of the mapping T. More precisely note that, according 
to Theorem 3.1, decoupling is achieved whenever the exponent H^ ain (A\E) p +H^ ain (A\B) T 
is sufficiently larger than 0. Our converse now says that this is also a necessary condition 
(up to additive terms of the order log 1/e) if one replaces the min-entropy in the second 
term, H^ aia (A\B) T (which characterizes the channel), by a max-entropy, H^ aax {A\B) T . 

The two terms, H^ ain (A\B) T and H^ smx (A\B) T , coincide for many standard channels used 
for applications (e.g., for state merging, cf. Section 5). Examples of such channels are 
given in Table 2. Furthermore, as we shall explain in the discussion section, the two 
terms also coincide asymptotically for iid channels. 

Theorem 4.1. Let p AE e S = (U AE ), Ta^b a TPCPM, and t ab = \A\ (^pX) J J{T) (^pl) 1 ■ 
Suppose that 

\\T(pae) - T{p A ) ® Hi < e . 

Then, for any e',e" > 0, 

Hlir^ +2V?+£ '\A\E) p + B£j,A\B) T > -log i . 

Note that in the above, tab can be produced by taking a purification ( AA / of pa where 
the system A' is a copy of A, and sending the A' part through the channel. 

Proof. Let paer be a purification of pae and Uj^ BB , a Stinespring dilation of T. Further- 
more, let 

\o)bb'er '■= (Ua^bb' ® ^er)\p)aer , 

and \(t)bb'er be a subnormalized state such that H m3iX (ER\B) a = H^ ax (A\B) T with 
P{a,a) s: e" , as well as \o) B b>er such that a BE = <?b ® °e, and F(a BB r ER ,a BB , ER ) = 
F(a BE , gb®ge) — such a state exists by Uhlmann's theorem, and can be shown to satisfy 
P(a, a) < y/Qe" + 2e. The latter bound can be obtained from 

\\&BE — &Be\\i ^ \\&BE — &Be\\i + W&BE ~ &BE\\l 

^ \\cr B E — &be\\i + \\&be — &B <8> (Je\\i + W&B ® &E — cr B <g> a E || i 
sS e" + e + \\a B <8> a E - a B <8> cr E \\i + \\a B <g> a E - a B ® a E \\i 
^ 3e" + e , 

combined with Lemma B.l. Now, we know from Lemma A.8 that 

O-BB'ER < 2 H ^ ER \ B U°Y EBR <g> l s , , 

where 

Y BER := 2-^-(^«| B ) CT|CT(T -Vy (T V2 (7BER(7 V2 (r -l/2 
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This implies that 

obb'er < \ • 2 H ™^ ER \ B U° ((1 - e')a B 1/2 a BER a B 1/2 + e'Y BER ) ® t B , (4) 
for any e' > 0. Tracing out the R system, we get 

<?beb> < ^7 • 2 Hm "< B *l fl >*i* ((1 - £ ')1b ® + e'Ves) ® Is' • 

We now define Gbb := V / 1^£ 7 <4 /2 ((1 - e')ls ® ob + e'yijij)- 1 / 2 . Note that G is a 
contraction (i.e. ||G||oo < 1): 

GG^ = (1 - £')4 /2 ((! " ^')lfl ® ^ + 4 2 
< (1 - e')af ((1 - £ ')lfl ® 4 /2 

= 1b£, 

where we have used the operator monotonicity of /(i) = — 1/t. At this point, we conju- 
gate both sides of (4) by Gbe to get 

G B e°beb<G be < ■ 2 H ^ ER \ B U^ E ® Ibb' (5) 

^ . 2 H ^( ER \ B )^a E ® Ibb' ■ (6) 

Let us now define |V , )bebb / := Gbe\c}berb'} note that since G is a contraction, |^>) is 
subnormalized. Then, we can rewrite (6) as: 

i>BEB> < ^ ■ 2 H ™^ ER \ B U°G E ® IflB' , 

e 

which implies 

H min (BB'\E)^ G > -H max (ER\B) al(7 - log(l/ £ ') . 

We will now need to show that ipBEB' is (2V6e" + 2e + 2\fe i + e") -close to obeb 1 , because 
this implies the claim 

H 2 m ir^ +2V?+£ "(A\E) p + H^{A\B) T > - log(l/,') . 
To this end, we shall define the following states 



W)berb' 
\ip }berb' 

\^)BERB' 



G be \cf)berb' 
Gbe\&)berb' 



VT — s'G be \o)berb' ■ 

We first show that all these states are (sub)-normalized such that the purified distance 
between them is well-defined. Since Gbe is a contraction, we immediately get that 
^ 1 and || < 1. Furthermore, 

= (l-e , )(a\G BE t G BE \a) 

= (a\a E 1/2 ((1 - e')t B ® a E + e'Y BE ) a E 1/2 \a) 
= 1 - e' + e'(a\a E 1/2 YB E (T E 1/2 \a) 



1 - £ + e' Tr 



V -1/2 _ -1/2 
*BE<? E &EB& E 



l-e' + e'Tr [Y BE a B ] 
1 . 
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We have (4>\ip') = Vl - s', and 

(a\& = VT^(*\G BE \a) 



= Tr 
= Tr 
> Tr 



(<r B ® ((1 " e')l B ®<?E + e'Y BE ) 1/2 a E 1/2 
{as ® 4 A ) ((! " ® ^fi + e'Y BE f 12 



1/2 \ 



= Vl - e' Tr [cr B <g> cr^] 

where the inequality is due to the operator monotonicity of the square-root function. 
Therefore P(i/}', <r) < 2%/? and furthermore P(ip" ,a) = P(ip' ,a), since 

F^",a)={a\G BE \a) = F{a^'). 

Since conjugation by G is trace-non-increasing, we also have 



P«, t/0 < P((7, a) < V6e" + 2e . 

This implies 

P(V>, 5) < P(iJ>, O + PW,d-) + P{°, <r) + P((r, 5-) < VGe" + 2e + 2%/i 7 + V6e" + 2e + e" . 

□ 



5 One-Shot State Merging 



As an example application of the decoupling theorem and its converse we discuss One- 
Shot Quantum State Merging. This is a two-party task: its goal is to transfer the information 
contained in a quantum system, A, initially held by one party, Alice, to the other party, 
Bob. This should be achieved with only limited resources (such as entanglement or com- 
munication). It is taken into account that Bob may have access to a quantum system, B, 
correlated to A, which may be used to minimize the use of resources. The term one-shot 
is used to emphasize that the task is considered in the general one-shot scenario. As ex- 
plained in the discussion section, the asymptotic iid results, where many independent 
copies of a given state are transferred, can be recovered as a special case. 

The notion of quantum state merging has been introduced in [HOW05, HOW07] and a 
protocol has been proposed that achieves the task in the asymptotic iid scenario. The 
more general one-shot setup we consider here was first analyzed in [Ber08] and prelimi- 
nary results appeared in [KRS09]. 

We start giving a formal definition of quantum state merging [HOW05, HOW07, Ber08]. 
Let pab be the joint initial state of Alice and Bob's systems. We can view this state as 
part of a larger pure state pab e that includes a reference system E. In this picture state 
merging means that Alice can send the ^4-part of pabe to Bob's side without altering 
the joint state. We consider the particular setting proposed in [HOW05] where classical 
communication from Alice to Bob is free, but no quantum communication is possible. 
Furthermore, Alice and Bob have access to a source of entanglement and their goal is to 
minimize the number of entangled bits consumed during the protocol (or maximize the 
number of entangled bits that can be generated). 
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Definition 5.1 (Quantum State Merging). Let pab £ S = {%ab), and let AqBq be additional 
systems. A TPCPM 8 : AA <8> BB A x ® B\B'B is called Quantum State Merging of p AB 
with error e ^ 0, if it is a local operation and classical forward communication process for the 
bipartition AAq — ► A\ vs. BBq —> B\B'B, and 

(£ <g) l E ){$A oBo ® 9 ABE) *s &A lBl ® PBB'S , 

a>/zere pbb'e = {^a^b 1 ®3-be)pabe for a purification pabe of pab, and <& K , $ L are maximally 
entangled states on AqBq, A\B\ of Schmidt-rank K and L, respectively. The number 

I s : = logiC- logL 

is called entanglement cost. 6 

We are interested in quantifying the minimal entanglement cost for Quantum State Merg- 
ing of pab with error e. For this, we use the achievability and converse for decoupling 
(Theorem 3.1 and Theorem 4.1). These allow us to derive essentially tight (up to additive 
terms of the order log 1/e) bounds on the entanglement cost. 

The basic idea underlying our analysis of Quantum State Merging is the observation that 
the desired situation after the protocol execution is necessarily such that Alice's system is 
decoupled from the reference. Furthermore, it follows from Uhlmann's theorem [Uhl76] 
that this decoupling is also sufficient. 

Theorem 5.2 (Achievability for Quantum State Merging). The minimal entanglement cost 
for Quantum State Merging of pab £ S=(7~Lab) with error e > is upper bounded by 

/ e ^ Hg£ 3 (A\B) p + 41og(l/e) + 21og 13 . 

Proof. Let pabe be a purification of pab- The intuition is as follows. In the first step 
of the protocol, Alice decouples her part from the reference (employing Theorem 3.1), 
where she chooses a rank-L projective measurement as the TPCPM, and she sends the 
measurement result to Bob. For all measurement outcomes the post-measurement state 
on Alice's side is then approximately given by ® pe and Bob holds a purification of 

this. But <g> pe is the reduced state of &\ 1 b 1 ® Pbb'e as well and since all purifications 
are equal up to local isometries, there exists an isometry on Bob's side that transform the 
state into &a 1 b 1 ® Pbb'e (by Uhlmann's theorem [Uhl76]); this is then the second step of 
the protocol. 

More formally, choose K and L such that 

logiC - logL = Hg£?{A\B) p + 41og(l/e) + 21og 13 , (7) 

which is the entanglement cost of the protocol. 7 

6 In the original references [HOW05, HOW07] quantum state merging was defined slightly differently, 
namely as a local operation and classical two-way communication process. However, their protocol for the 
achievability only uses classical forward communication. 

7 Since we need K, LeM, we can not choose logic" — logL exactly equal to H^l^ 3 (A\B) P + 41og(l/e) + 
2 log 13 in general. Rather, we need to choose K, L e N such log K — log L is minimal but still greater or 

equal then H^ 3 {A\B) P + 4 log (1/e) + 2 log 13. 
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Choose TV fixed orthogonal subspaces of dimension L on AAq, 8 denote the projectors on 
these subspaces followed by a fixed unitary mapping it to A\ by P% a-^Ai an< ^ define the 
isometry 

WaoA^A^aXb ■= J] PloA^A! ® \ x )x A ® \x)x B ■ (8) 

X 

Denote by Ua a a unitary selected randomly according to the Haar measure over the 
unitary group on Ha a and write 

Oa B ABE ■= ®A B ® PABE 

caqBoAbe ■= Ua a6a b abeU AoA . 

Now the first step of the protocol is to apply this unitary followed by the isometry (8), and 
to send the Xb system to Bob. In order to take into account that the channel is classical, 
we keep a copy Xa at Alice's side. 

By the decoupling theorem (Theorem 3.1) we get for 

<? AxXaXbBqBE = (Wa A->A 1 X a X b ) c7 AoB ABe(Wa A->A 1 X a X b )^ ■ 

that 

\Wmx a e - r Al x A 9pb\x* 2"V^<* WaM + 12 . g2 (9) 

where A' A' is a copy of AqA, and 

l T >A; ) A'A 1 X A X s : = WaoA^A^aXb^A'qA'AqA 

with 

\®)a> a>a a ■= kT\A\^^ a '° a ' ® ^ A ° A ' 

We can simplify this using the super additivity of smooth min-entropy (Lemma A. 2) and 
the duality between min- and max-entropy (Lemma 2.5) 

H e 2l 3 (A A\E) e > H £ 2l\A\E) p + log K = -Hg£*(A\B) p + logK . (10) 

Furthermore, because ta' q a'AxX a * s classical on Xa, we can use a lemma about the min- 
entropy of classical-quantum states (Lemma A.5) and get 

^minVo^lXOr > H min (A' Q A' | A X X A ) T = " log(J> a " 2" 2f » i »^ A 'l Al ^) 

x 

^ mm H min (A' Q A' \ A!) T . , 

X 

where 

T ALA'A\ := —^={^A' A' ® P AoA -^ Ai )\$}a' A'A A 
y Px 

Px ■= \(t A > A> ® ^3oA-Ai)l*>A' A'A)Al • 



8 For simplicity assume that K ■ \A\ is divisible by L. In general one has to choose N — 1 fixed orthogonal 
subspaces of dimension L and one of dimension U = K ■ \ A\ — (N — 1) • L < L. The proof remains the same, 
although some coefficients change. 
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But since P AqA —, a , ^ s a ran k L projector, we can use a dimension lower bound of the 
min-entropy (Lemma A.3) to conclude that for all x 

H^A'qA^Ax)^ > -logL . 

This together with (7), (9) and (10) implies 



l^ll 



PB 



12 



^ 2 -l/2(log if -log L-if^ : (A|B) P ) + _ . £ 2 



13 



12 



_ 9 -l/2(41og(l/e)+21ogl3) . ±f . -2 _ 2 

~ 2 + 13 £ " £ ' 

and hence -^((tajXaE, npn- ® t Xa ® ps) 5* 1 - ^e 2 (by Lemma B.l). 

In the second step of the protocol, Bob decodes the system to the state pbb'e® 3%Bi ■ A 
suitable decoder can be shown to exist using Uhlmann's theorem [Uhl76]. There exists 
an isometry V B b x b -^bb / b 1 x b such that for 



VA 



1 X A X B BB'B 1 E '■= {VbBuXb^BB'BiXb) A 1 X a X b BB e{VbBoX b ^BB'B 1 Xb)^ 



F{ct AiXa E, 7-T7 ® T Xa ® p#) = F{r) Al x A X B BB'Bi.E,TX A X B ® ®AiBi ® PBB'E) , 

I -All 

and with that 

F(vaiX a x b bb'BiE, tx a x b ® ® Pbb'b) > 1 - ^e 2 • (11) 

Expressing this in the purified distance (with Lemma B.l) and discarding X a Xb, we 
obtain a e-error Quantum State Merging protocol for p A BE- □ 

Theorem 5.3 (Converse for Quantum State Merging). The minimal entanglement cost for 
Quantum State Merging of p AB e S = (T~L A b) with error e > is lower bounded by 

l £ >H^ 3 %A\B) p -21ogl. 



Proof. We start with noting that any e-error Quantum State Merging protocol for p A B 
can be assumed to have the following form: applying local operations at Alice's side, 
then sending a classical register from Alice to Bob, and finally applying local opera- 
tions at Bob's side. For a purified state p A BE, the protocol produces a state e-close to 

*AiBi ® PBB'E- 

As can be seen from the definition, it is a necessary step for any Quantum State Merging 
protocol to decouple Alice's part from the reference. The idea of the proof is to use the 
converse for decoupling (Theorem 4.1). This then results in the desired converse for 
Quantum State Merging. 

More precisely, a general e-error Quantum State Merging protocol for p A s e has the fol- 
lowing form. At first some TPCPM 

T AoA ^ Al x B (-) =Y i M A Q A^A 1 (-) ® \x){x\x B 
x 
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is applied to the input state &a o b ® Pabe- By the Stinespring dilation [Sti55] we can 
think of this TPCPM as an isometry 

WaoA^AgXbXa = 2 M MA^A 1 A G ® l x >*4 ® k>X s , (12) 

where the A^I 0j 4^AiA g are partial isometries and Aq,Xa are additional 'garbage' reg- 
isters on Alice's side that will be discarded in the end. The isometry W results in the 
state 

^A-lAgXaXbEBoE ■= ^ \l X )A 1 A G BB E ® \x)x A ® |^>X S , 
a; 

with 

|7 s >A 1 A o sb b := M1 oA ^ iAg (|$ jc > AoBo S |p>abe) • 

The next step of the protocol is then to send the classical register Xb to Bob. 

Now let us analyze how the state "fA 1 A G x A E has to look like. By the definition of Quantum 
State Merging (Definition 5.1) the state at the end of the protocol has to be e-close to 
^AiBi ® Pbb> e- This implies that Alice's part A\ has to be decoupled from the reference. 
But because the state <& I a 1 b 1 ® Pbb'E is pure this also implies that all additional registers, 
that we might have at the end of the protocol, have to be decoupled as well. Thus we 
need 

7Aj A G X A E TXT ® 1A G X A ® P£ , (13) 

I "All 

and in trace distance (using Lemma B.l) this reads 

TaMgXaE - T-r-f ® 7A G x A ® Pe ^ 2e . (14) 
l>n| i 

Using the converse for decoupling (Theorem 4.1) for the isometry Wa a^AxA g x b x a in (12) 
followed by the partial trace over Xb, we get that the decoupling condition (14) implies 
for any e', e" > that 

H 2 J^ +2V ^ +E "(AoA\E) p + H^M*\MA G X A ) T > - log i , 

where 

^A'AMgXa := trx s (1a' A' ® ^AoA^AiA g X b xJCaJ,A'A a(1a' A' ® WaoA^AiAg^xJ 

f° r Ca^A'AoA a purification of ® ^a with A' A' a copy of AqA. As a next step we 
simplify this in order to bring the converse into the desired form. 

Choosing e' = e 2 and e" = e, using a dimension upper bound for smooth min-entropy 
(Lemma A.4), and the duality of min- and max-entropy (Lemma 2.5) we obtain 

log if + H^A'^A'^AgXa), >H^ +3 %A\B) p - 21og \ . 
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By the decoupling criterion in purified distance (Equation (13)), the state ta> a'A 1 a g x a 
has to be e-close to a state 

Ca' A'A 1 A g X a =Yi qx ^A' A'A 1 A G ® \ x X x \x A , 
x 

where q x is some probability distribution and A / AlAc pure with £, AlAa = jjt\ ® £1 G f° r 
all x. Hence 

H^A'qA'^AgXa), sc H^A'qA'IAUgXa^ 
and by a lemma about the max-entropy of classical quantum states (Lemma A.6) 

H^A'oA'IA.AgXa^ = log \J^q x ■ 2 H ^ A 'o A '\ A ^o^ 

Using the duality of min- and max-entropy (Lemma 2.5) and a polar decomposition of 

£a' A'AiA g ' we § et 

H max (A A \AiAq)^ = —H m i n (A' A)^ = — H m i n (AiAc)^ 

= —H m i n (Ai) — H min (AG)^ ~H min (Ai) = — logL . 

Hence the converse becomes 

log K - log L ^ H^ +3s (A\B) p - 2 log ± . 

□ 



6 Discussion 

The main contribution of this work is a decoupling theorem, i.e., a sufficient (Theo- 
rem 3.1) and necessary (Theorem 4.1) criterion for decoupling in terms of smooth en- 
tropies. The fact that the criterion is nearly optimal for various choices of the decoupling 
map T suggests that use of smooth entropies is natural in this context. 

A crucial property of our decoupling theorem is that it is valid in a one-shot scenario, where 
the decoupling map T may only be applied once (or, by replacing T by T® k ', any finite 
number of times). This contrasts with (and is strictly more general than) the iid scenario 9 
usually considered in information theory, where results are stated and proved asymp- 
totically under the assumption that the underlying processes (such as channel uses) are 
repeated many times independently. The generalization to the one-shot scenario is par- 
ticularly relevant in the context of applications in physics (e.g., the study of black hole 
radiation as considered in [HP07, BP07, BZ09] or the analysis of thermodynamic sys- 
tems [dRAR + ll]), where the channel T is supposed to model the evolution of a single 
system. 

We note that asymptotic iid statements can be easily retrieved from the general one- 
shot results using the Quantum Asymptotic Equipartition Property (AEP) for smooth 

9 The abbreviation iid stands for independent and identically distributed. 
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entropies [Ren05, TCR09] (see Lemma 2.7). For this, the decoupling map T as well as 
the initial state pae need to be replaced by many identical copies of themselves, i.e., T® n 
and /5®£. The achievability bound of Theorem 3.1, i.e., the condition that is sufficient for 
decoupling, then turns into the criterion 

H(A\E) p + H(A\B) T >0, (15) 

where H denotes the (conditional) von Neumann entropy. Analogously, the converse, 
i.e., the condition which is necessary for decoupling, turns into 

H(A\E) p + H(A\B) T ^0, (16) 

In other words, in the iid scenario, the achievability bound (15) and the converse bound (16), 
taken together, imply an exact characterization of decoupling. 

The decoupling theorem, in its general form stated in Section 3, has various applications. 
As illustrated in Section 5, these are often possible because of a duality between indepen- 
dence and maximum entanglement: given a pure state pber such that ps is maximally 
mixed, the property that the subsystem B is independent of E and the property that B is 
fully entangled with R are equivalent. 

Information-theoretic applications other than state merging (cf. Section 5) have been in- 
vestigated in [Dup09]. One of them is channel coding. Here, Alice wants to use a noisy 
quantum channel N A ^ B to send qubits to Bob with fidelity at least 1 — e. The idea is 
that decoding is possible whenever a purification of the qubits Alice is sending is de- 
coupled from the channel environment. One can therefore get a coding theorem directly 
from Theorem 3.1 by setting T to be the complementary channel of M (i.e. consider a 
Stinespring dilation Uf^ BE of TV, and set T A ^ E {-) := Tr B [U • J7 1 "]). Unassisted channel 
coding [Llo97, Sho02, Dev05] can be obtained by choosing the input state par = &ar 
(where &ar is a maximally entangled state between A and R). Similarly, entanglement- 
assisted channel coding [BSST02] corresponds to the input choice pabr = ®A R R ® ^A B B 
(where %a = Ua r ® Ha b > with Ar containing the state to be transmitted and Ab the 
initial entanglement that Alice shares with Bob). Other choices of pabr correspond to 
other scenarios. 

Another application where decoupling can be employed as a building block for con- 
structing protocols is the simulation of noisy quantum channels using perfect classical 
channels together with pre-shared entanglement. The claim that this is possible using 
only a classical communication rate equal to the capacity of the channel to be simulated, 
is known as the Fully Quantum Reverse Shannon Theorem [BSST02, BDH+09]. In [BCR11], 
a proof of this theorem using the decoupling technique has been proposed. 
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A Properties of Smooth Entropies 



Lemma A.l. Let p AB e S^Hab)- Then, H 2 (A\B) p ^ H min (A\B) p . 

Proof. Let a B e S = (H B ) be such that p AB < 2- H ™^ A \ B ^t A <g> a B . We then obtain 

Jl A <g> u B y 1/2 p AB (t A ® w B ) _1/2 PAB 

'M ® OBY Xl2 pAB{^A ® CT B ) _1/2 PAB 



2 -H 2 (A\B) p =minTr 
^ Tr 



□ 



Lemma A.2 (Superadditivity of smooth min-entropy). Let e,e' > 0, p AB e 5= ("Has) 
/Z^, e S=(U A > B >). Then 

HS?(AA'\BB'U, > H^ n {A\B) p + H< m (A'\B') p , . 



Proof. Let p AB e B £ (p AB ) and /o^, s , e £ e (p^, B ,) such that H^ in (A\B) p = H min (A\B) p and 
H^ ia (A>\B>) p , = H mm (A'\B>) p , 

By the triangle inequality for the purified distance [TCR10, Lemma 5] we have p AB ® 
p' A i B i £ B £+£ (p A B <8> p'a'B')- Using the additivity of min-entropy [KRS09] we conclude 

-^mtn {AA'\BB') p ® p i > H min (AA'\BB') p!S)p -> = H min (A\B) p + H min (A'\B') p i 

= H mm( A \ B )p + H Lm( A '\ B ')p' • 

□ 

Lemma A.3. [TCR10, Lemma 20] Let p AB e S = {H AB ). Then H min (A\B) p > - log \B\. 
Lemma A.4. Let e ^ and p AB c £ <->=(% abc)- Then 

H^{AB\C) p H £ min (A\C) p + log \B\ . 



Proof. Let p AB c £ B £ {pabc), &c e S = (Hc) ar »d A e R such that 

flSun(AB|C) p = H mm (AB\C)- p = - log A , 

that is, A is minimal such that A • Iab ® <?c — PABC ^ 0. By taking the partial trace over 
B we get \ • \B\ ■ \ A ® ac — p A c ^ 0. Furthermore we have by the monotonicity of the 
purified distance [TCR10, Lemma 7] that p A c £ B £ (p A c) and hence 

H £ m ^{A\C) p > H min (A\C) p > -log/i , 

where /i e R is minimal such that p ■ 1 A (g) ere — p^c ^ 0. Thus A • |B| 5 s M an ^ therefore 

H^(AB\C) p H^(A\C) p + log \B\ . 

□ 
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Lemma A.5. Let p AB x £ S={U A bx) with p ABX = H x PxPab®\ x X x \x and p x AB e S = {U AB ) 
for all x. Then 

H mhi {A\BX) p = - log(J] p t ■ 2 - H ^ A We* ) . (17) 



Proof. By the operational interpretation of the min-entropy as the maximal achievable 
singlet fraction [KRS09, Theorem 2] we have 

H min {A\BX) p = -\og(\A\ • max F\(1 A ® T BX ^A')(pabx), \®)<$\aa>)) , 

where the maximum is taken over all TPCPMs Tbx^A', \&)AA' = |^4|~ 1//2 Yn \ x )a ® |^)a'/ 
and 7^/ s Ha- Writing out the min-entropy terms on the right hand side of (17) in the 
same manner we obtain 

H miQ (A\B) p * = -]og(\A\ ■ max F 2 ((1 A <8> ^ A ,)Wb), \*X*\aA')) ■ 

B—>A f 

The claim of the lemma is therefore equivalent to 

max F 2 ((1 A ® Fbx^A')(pabx), \$)<$\aa') 

•F BX^A' 

= £ iax f2 (^ ® ^-A'Xtfb), \®)<®\aa>) ■ 

X ^B^A' 

Now, because the state p AB x is classical on X, the maximization on the left hand side 
can without loss of generality be restricted to TPCPMs that first measure on X in the 
basis {|x>} and then do some TPCPM F' B ^ A , conditioned on the measurement outcome 
x. By the linearity of the square of the fidelity when one argument is pure, the claim then 
follows. □ 

Lemma A.6. Let p ABX e S=(U A bx) with pabx = HxPxP i AB®\ x ')( x \x and Pab e 5 =(X4b) 
for all x. Then 

H max (A\BX) p = log£> • 2 a ^ A \ B ^) . (18) 



Proof. Let pabcxx' be a purification of pabx ■ Then we have by the duality of conditional 
min- and max-entropy (Lemma 2.5) and a lemma about the conditional min-entropy of 
classical quantum state (Lemma A.5) that 

H max (A\BX) p = -H min (A\CX') p = log(J>x ■ 2-*W4<V) = log£>* • 2 ff »»<4 i V) . 

X X 

□ 

Lemma A.7 (Chain rule for smooth min-entropy). Let e > 0, e',e" > and pabc e 
S^-Habc)- Then 

H;^"(AB\C) P > H< n (A\BC) p + H< n {B\C) p - log | 
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Proof. Let p' ABC e B £ '{ PA bc) such that H< n {A\BC) p = H min (A\BC) p , and let p' ABCE be 
a purification of p' ABC - Furthermore let p" BC e B £ (p BC ), oc e S={Ubc) and A e R such 
that ff„ lin (i?|C)p = H m i n (B\C) p " = — log A, that is, A is minimal such that 

A • t B ® a c - p"bc > • (19) 
By [TRSS10, Lemma 21] there exists a projector Pae such that 

Pabce : = [Pae ® 1bc)p'abce( p ae ® Ibc) e B £ {p' ABCE ) , 

and 

a-^Ai^+iog £ . lA ^ _ ^ (20) 

Now let Tgc he defined as in Lemma B.2 with p" BC = Tbcp'bc^bc an< ^ consider the state 

Pabce : = (Iab ® Tbc^abce^ae ® r| c ) = (P^e ® T B c)pabce( p ae ® T BC ) . 
Applying Tbc to (20) we obtain 

2- B &*M BG >,+**$ . lA 9 p » BC _ Abc > o . 
Together with (19) this yields 

2 ~H^A\BC) p+ lo g ^; in (B\C) p . ^ a(j _ > Q 

This implies 

tf min (^|CV > F< n (A|BC) p + H< n (B\C) p - log | . (21) 

But by the monotonicity of the purified distance [TCR10, Lemma 7] and the definition of 
Tbc w e have 

P(PabciPabc) < P((Pae ® T bc )p'abce(Pae <S> r| a ), (P^ ® ^bc)p'abce{. p ae ® Isc)) 
«S ® T B c)p'abce(1ae (8) T BC ),p ABCE ) = P(p'bc p' BC ) , 

and hence 

?(^BC /W) < ^(Pbo Pbc) + P(pbcp'bc) < + • 
Finally we obtain 

P(PabcPabc) < P{p'abciPabc) + P(p'abciPabc) + P(p'abciPabc) 
^ e" + e' + e + e' = e + 2e' + e' , 

and thus together with (21) that 



A^-h^IC^ > fl^BC), + ^< n (^|C)p - log 3 



t- 1 

□ 
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Lemma A.8. Let pabc e S^(Habc) be a pure state. Then, for ob e S = (T~Lb) with full rank, 
we have that 

Pabc < Z A b ® lc, 



w/iere Zab = ^ Hm ^ A ^ B ^ pW o- B 1 ^ 2 y a^pAB^^^B 1 ^- Furthermore, Zab has the property 
thatTr[Z AB <T B ] = 2 fl ' M *( A l B W. 



Proof. Consider the following semidefinite program (for a introduction to semidefinite 
programs presented in this manner, see for instance [Wat08]): 

Primal 

Dual 

maximize: Tr[p AB cX A Bc] ... „ r/ „ w , 

,, ^ [v , ^ minimize: Tr[(l A <g> ct b )zW] 

subject to: TtcI^abcJ = ^ ® °b , . ^ 

subject to: p^BC < ® lc- 

-<*ABC 5= U. 



It is clear that the optimal value of the primal problem is 2^ max ^ j4 ' B ^l CT from Definition 2.9 
and Uhlmann's theorem. One can also easily show that strong duality holds (i.e. that the 
optimal value of the dual problem is equal to that of the primal problem): one simply 
needs to show that there exists a Zab such that Zab ® lc > Pabc, which holds for 

Zab = 21 ab- 

Now, we need to show that the optimal Zab for this problem has the form given in the 
lemma statement. First, note that by Uhlmann's theorem, there must exist an optimal 
Xabc which has rank 1, assuming we consider the system C to be large enough. Let 
Xabc = I^XvIabc an d let pabc = \pXp\abc, an d consider the complementary slack- 
ness condition for X and Z to be optimal: pabcXabc = {Zab <8> 1c)X A bc- We can 
rewrite this as 

(p\<p)\pX<f\ = (Zab ® 1 C )\<PX<P\ 

and therefore 

<P\<P>\P> = (Zab® tc)\<P), 

and 

F(p,v) 2 \p)(p\ = (Zab ® 1 c )\<pXv\(Z ab ® lc)- 
Tracing out C and using the fact that F(p, ip) 2 = 2 /fmax ( j4 l B )pk, we get 

2 H n ^(A\B) plapAB = Zab{1a g, aB )z AB . 

1/2 

Now, conjugating both sides by o B and taking square roots on both sides, we get that 

2l H ^\^afp AB af = afZABof. 

— 1/2 

If <jb has full rank, we get the expression for Zab by conjugating both sides by a B . 
Finally, the fact that Tt[Zab&b] = 2^ max ^' B ^i CT can simply be computed from the expres- 
sion for Z. 

□ 
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B Technical Lemmas 



Lemma B.l (Lemma 6 in [TCR10]). Let p, a e S^(H). Then, 



1 



D(p,a) sc P(p,a) < ^2D(p,a) < V2||p-^l|i 
P(p,a) 2 <£(p,a)<P(p,a). 



whereD(p,a) := §||p-(7||i + ±| Tr[p] - Tr[<r]|. 

Lemma B.2. Le£ pab £ S^(T-Lab) and a a £ 5^ ("Ha)- T/zen ^re exists Ta e £(Ha) with 

oab ■= (Ta <8> Is) pas 
an extension of a a such that P(pab,cab) = P(pa, o*a)- 



— — 

Proof. Define Xa := cr;|p A and polar decompose X^ = Va ■ (X A X4) 1//2 . Furthermore 
define Ta ■= <t\VaPa 2 > where the inverse is a generalized inverse. 10 We have 



Tr B ((T A ® 1b)pab(t1 ® 1b)) = T aP aT ] a = o\V A V\A = °A , 

which shows that oab = (Ta ® 1 b)pab(t\ ® 1_b) is an extension of a a- Thus it remains 
to prove that P(pab, <?ab) = P(pa, <ta)- 

For this we first assume that pab is pure and normalized, i.e., pab = \p)(p\ab £ S=(Hab)- 
Then we have 



P(PAB, tab) = V1-F 2 (pab,<tab) = Vl-|<Pk>l 2 = Vl- Kp\Ta® 1b\p)\ 2 



1 - |Tr [(T A <8> 1b)pabT = \l 1 - ^ (°aVaPa 12 ® 1b)pab 



Tr 



1/2 T/ 1/2 
^A VAPi 



Tr 



1/2 1/2 T/ 

Pa ^a Va 



Tr 



1/2 1/2 

Pa ^Pa 



VI-F^Pa^a) = P(pa, pa) ■ 



If PAB = |p)(p|ab is not normalized we obtain analogously 



P(pab,ct A b) = V 1 - [P(pab, OAs) + V( 1 - Trp A B)(l - Ttoab)] 2 



1 - [F(p A , a A ) + V(l "Tr pa) (1-Tr^)] 2 = P(p A , a A ) • 



The statement for a general pab (not necessarily pure) follows by the monotonicity of the 
purified distance [TCR10, Lemma 7] under partial trace. □ 



10 For M e V, M" 1 is a generalized inverse of M if MM' 1 = M~ 1 M = supp(M) = supp(M 1 ), where 
supp(.) denotes the support. 
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